Picture for Yu Sun

Yu Sun

Sherman

Super-Resolution and Denoising of Corneal B-Scan OCT Imaging Using Diffusion Model Plug-and-Play Priors

Add code
Feb 02, 2026
Viaarxiv icon

End-to-end reconstruction of OCT optical properties and speckle-reduced structural intensity via physics-based learning

Add code
Feb 02, 2026
Viaarxiv icon

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Accurate Network Traffic Matrix Prediction via LEAD: an LLM-Enhanced Adapter-Based Conditional Diffusion Model

Add code
Jan 29, 2026
Viaarxiv icon

Open-Vocabulary Functional 3D Human-Scene Interaction Generation

Add code
Jan 28, 2026
Viaarxiv icon

CORD: Bridging the Audio-Text Reasoning Gap via Weighted On-policy Cross-modal Distillation

Add code
Jan 23, 2026
Viaarxiv icon

Learning to Discover at Test Time

Add code
Jan 22, 2026
Viaarxiv icon

VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction

Add code
Jan 09, 2026
Viaarxiv icon

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Add code
Jan 08, 2026
Viaarxiv icon

FROST-Drive: Scalable and Efficient End-to-End Driving with a Frozen Vision Encoder

Add code
Jan 06, 2026
Viaarxiv icon